Validity Measure of Cluster Based On the Intra-Cluster and Inter-Cluster Distance
نویسنده
چکیده
The k-means method has been shown to be effective in producing good clustering results for many practical applications. However, a direct algorithm of k-means method requires time proportional to the product of number of patterns and number of clusters per iteration. This is computationally very expensive especially for large datasets. The main disadvantage of the k-means algorithm is that the number of clusters, K, must be supplied as a parameter. In this paper we present a simple validity measure based on the intra-cluster and inter-cluster distance measures which allows the number of clusters to be determined automatically. The basic procedure involves producing all the segmented dataset for 2 clusters up to Kmaxclusters, where Kmaxrepresents an upper limit on the number of clusters. Then our validity measure is calculated to determine which is the best clustering by finding the minimum value for our measure.
منابع مشابه
Biochemical and bioassay studies on the influence of different organic manures on the growth of Mulberry Variety V1 and silkworm, Bombyx mori Linn.
The genetic variation and diversity among fifty-eight polyvoltine silkworm genotypes was estimated by using ten economic traits. The results revealed that the single shell weight showed higher genetic variation such as PCV% (17.20%), GCV% (12.93%), and heritability (56.5%) followed by single cocoon weight, shell ratio and matured larval weight. The D2 (Mahalonobis? distance) statistics reveal...
متن کاملAn analysis of genetic variation and divergence in Indian tropical polyvoltine silkworm (Bombyx mori L.) genotypes
The genetic variation and diversity among fifty-eight polyvoltine silkworm genotypes was estimated by using ten economic traits. The results revealed that the single shell weight showed higher genetic variation such as PCV% (17.20%), GCV% (12.93%), and heritability (56.5%) followed by single cocoon weight, shell ratio and matured larval weight. The D2 (Mahalonobis? distance) statistics reveal...
متن کاملA kernelized fuzzy c-means algorithm for automatic magnetic resonance image segmentation
In this paper, we present alternative Kernelized FCM algorithms (KFCM) that could improve magnetic resonance imaging (MRI) segmentation. Then we implement the proposed KFCM method with considering some spatial constraints on the objective function. The algorithms incorporate spatial information into the membership function and the validity procedure for clustering. We use the intra-cluster dist...
متن کاملارزیابی روشهای گروهبندی ژنوتیپ های کلزا با استفاده از تجزیه تابع تشخیص خطی فیشر
Discrimination function analysis is a method of multivariate analysis that can be used for determination of validity in cluster analysis. In this study, Fisher’s linear discrimination function analysis was used to evaluate the results from different methods of cluster analysis (i.e. different distance criteria, different cluster procedures, standardized and un-standardized data). Furthermore, H...
متن کاملارزیابی روشهای گروهبندی ژنوتیپ های کلزا با استفاده از تجزیه تابع تشخیص خطی فیشر
Discrimination function analysis is a method of multivariate analysis that can be used for determination of validity in cluster analysis. In this study, Fisher’s linear discrimination function analysis was used to evaluate the results from different methods of cluster analysis (i.e. different distance criteria, different cluster procedures, standardized and un-standardized data). Furthermore, H...
متن کامل